Publication and use of large data sets
نویسنده
چکیده
Scientific information comes in many sizes, types and levels of quality. Because of the diversity of the scientific information being published, different issues will arise in publishing different types of information electronically. In this paper, we will address issues related to electronic publication of large scientific data sets, a subset of scientific information often overlooked in discussions on electronic scientific publications. First, we establish the parameters that define large scientific data sets. Then we identify examples from a variety of scientific disciplines. Large data sets (LDS) require special technology for their creation and management, and that technology is briefly described, as well as traditional publication and use of LDS. We discuss electronic publication of large scientific data sets and their uses as they exist today. Finally, we look into the future of electronic publication of LDS, including issues such as intellectual property rights (IPR), and LDS as a source of new discovery and economics.
منابع مشابه
Application of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملSpatial Design for Knot Selection in Knot-Based Low-Rank Models
Analysis of large geostatistical data sets, usually, entail the expensive matrix computations. This problem creates challenges in implementing statistical inferences of traditional Bayesian models. In addition,researchers often face with multiple spatial data sets with complex spatial dependence structures that their analysis is difficult. This is a problem for MCMC sampling algorith...
متن کاملکاربرد روشهای شناسایی تورش انتشار برای فراتحلیل در ارزیابی تاثیر داروی آلبندازول در درمان مبتلایان به آسکاریس و تریکوسفال
Background : Meta analysis is a statistical method to combine the findings of a set of large number of published individual studies and re-analyse them. The use of meta-analysis methods in medical research has been increased, noticeably, in resent years. However, one of the major shortcomings in such analysis is that the researcher, could not access all conducted studies in the area of concern...
متن کاملMisconduct in Research and Publication
Dear Editor, I read the recent publication on “Misconduct in Research and Publication” with great interest[1]. I agree that misconduct in research and publication is not uncommon. Nevertheless, it is rarely mentioned. In fact, there are many incorrect conceptions among researchers on publication ethics. The milder examples are attempts to report only the “positive outcomes&rdq...
متن کاملExamining University Students' Scholarly Publication in English Journals: A Case for Postgraduate Students' Written Literacy Practices
This research aimed to screen 'essay writing' difficulties that non-native university students at postgraduate levels usually experience regarding scholarly publication in mainstream, English journals. Two sets of variables including written literacy competencies in Persian and English languages were mapped over language uses (General vs. Academic). Initial screenings from ...
متن کاملSelection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets
Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...
متن کامل